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Abstract 

Interconnection networks of parallel systems are used for servicing traf¬ 
fic generated by different applications, often belonging to different users. 
When multiple traffic flows contend for channel bandwidth, the scheduling 
algorithm regulating the access to that channel plays a key role in ensur¬ 
ing that each flow obtains the required quality of service. Fairness is a 
highly desirable property for a scheduling algorithm. We show that using 
the Relative Fairness Bound as a fairness measure may lead to decrease in 
throughput and increase in latency. We propose an alternative metric to 
evaluate the fairness and avoid the drawback of Relative Fairness Bound. 


1 Introduction 

Interconnection networks of parallel systems are used for servicing traffic 
generated by different applications, often belonging to different users. When 
multiple traffic flows contend for channel bandwidth, the scheduling algorithm 
regulating the access to that channel plays a key role in ensuring that each flow 
obtains the required quality of service. Fairness is a highly desirable property for 
a scheduling algorithm. A variety of scheduling algorithms have been proposed 
to meet the demand of fairness. We will introduce some related works herein 
and discuss another metric different from current mean to measure the fairness. 


2 Fairness measure 

In order to define the fairness, a measure called Relative Fairness Bound 
(RFB) has been proposed in [lj. M. Shreedhar et al. argue that it makes no 
sense to compare a flow that is not backlogged with one that is, because the 
former does not receive any service when it is not backlogged. 

Definition 1 A flow is backlogged during an interval I of an execution if the 
queue for flow I is never empty during interval I. 

Definition 2 Let FM(t±,t2) be the maximum, over all pairs of flows i, j that 
are backlogged in the interval (ti,t2), of {sentiltijtf)/fi — senti(t\,t2) /fj)■ De¬ 
fine FM to be the maximum value of FM(t\,t2) over all possible executions of 
the fair queuing scheme and all possible intervals (t\,t2) in an execution. 
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Definition 3 A service discipline is fair if FM is a small constant. In partic¬ 
ular, FM(ti , £2) should not depend on the size of the interval (tl,t 2 ). 

This evaluation standard is also known as Relative Fairness Bound. 


3 Related work 

3.1 Deficit Round Robin 

An algorithm for fairly servicing queues in a router call deficit round-robin 
(DRR) is designed in T|. 

They assume that the quantities /,, which indicate the share given to flow 
i, are specified as follows. They assume that each flow i is allocated Qi worth 
of bits in each round. Define Q = Mini(Qi), the share / t allocated to flow i is 
simply Qi/Q. Finally, since the algorithm works in rounds, they measure time 
in terms of rounds. A round is one round-robin iteration over the queues that 
are backlogged. 

Packets coming in on different flows are stored in different queues. Let the 
number of bytes sent out for queue i in round k be byteSi(k). Each queue i is 
allowed to send out packets in the first round subject to the restriction that 
bytesi( 1 ) < Qi. If there are no more packets in queue i after the queue has been 
serviced, a state variable called DCi is reset to zero. Otherwise, the remaining 
amount (Qi — byteSi(k )) is stored in the state variable DCi. In subsequent 
rounds, the amount of bandwidth usable by this flow is the sum of DCi of the 
previous round added to Qi. 

To avoid examining empty queues, they keep an auxiliary list ActiveList 
that is a list of indices of queues that contain at least one packet. Whenever 
a packet arrives to a previously empty queue i, i is added to the end of Ac¬ 
tiveList. Whenever index i is at the head of ActiveList, the algorithm services 
up to (Qi + DCi ) worth of bytes from queue i ; if at the end of this service 
opportunity, queue i still has packets to send, the index i is moved to the end of 
ActiveList ; otherwise, DCi is set to zero and index i is removed from ActiveList. 
Pseudo-code for this algorithm is shown in Fig. 1. Enqueue (), Dequeue () are 
standard Queue operators. They use a list of active flows, ActiveList, with 
standard operations like InsertActiveList, which adds a now index to the end of 
the active list. FreeBuffer () frees a butter from the flow with the longest queue 
using McKinneys buffer stealing. Qi is the quantum allocated to Queuet. DCi 
contains the bytes that Queuei did not use in the previous round. 

3.2 Elastic Round Robin 

DRR is a simple, fair, efficient and easily implementable scheduling disci¬ 
pline, while the information of packet size is needed. However, when a scheduler 
uses the size of a packet to make its decision, it cannot be readily adapted to 
the unique requirements of wormhole switching. Elastic Round Robin [2] satisfy 
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Initialization: 

For («' = 0;«’ < n; i = t + 1) 

DCi = 0 ; 

Enqueuing module: on arrival of packet p 
i = ExtractFlow(p) 

If ( ExistsInActiveList(i ) == FALSE ) then 
InsertActiveList(i)\ (*add i to active list*) 
DCi = 0 ; 

If no free buffers left then 

FreeBuf fer(); (* using buffer stealing *) 
Enqueue(i,p ); (* enqueue packet p to queue i*) 

Dequeuing module: 

While(TRUE) do 

If ActiveList is not empty then 

Remove head of ActiveList , say flow i 
DQ = Q, + DCi ; 
while {(DCi > 0) and 

(Queue; not empty)) do 
PacketSize - S ize(H ead(Queue t ))\ 

If (PacketSize < DCi) then 
Send( Dequeue(Queuei)) ; 

DC, = DC, 

— PacketSize ; 

Else break; (*skip while loop *) 

If (Empty (Queue;)) then 
DCi = 0; 

Else Insert Active List (i)\ 

Figure 1: Pseudo-code for DRR 


the need of fairness without the knowledge of packet size, therefore it can be 
used in both Internet routers and wormhole switches. 

Consider n flows, each with an associated queue with packets in it. The 
scheduler dequeues packets from these queues according to a scheduling disci¬ 
pline and forwards them for transmission over an output link. As in traditional 
scheduling problems, They allow that the length of time it takes to dequeue 
a packet is proportional to the size of the packet-however, to apply this work 
to wormhole networks, it is required that the scheduling algorithm not make 
any assumptions about the length of a packet prior to completely transmitting 
the packet. The ERR scheduler meets the above requirement. A pseudo-code 
implementation of the ERR scheduling algorithm is shown in Fig. 2, consisting 
of Initialize, Enqueue, and Dequeue routines. The Enqueue routine is called 
whenever a new packet arrives at a flow. The Dequeue routine is the heart of 
the algorithm which schedules packets from the queues corresponding to differ¬ 
ent flows. In their paper, they use a flit as the smallest piece of a packet that 
can be independently scheduled and they measure the length of a packet in flits. 
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Initialize'. (Invoked when the scheduler is initialized) 
RoundRobinVisitCount = 0; 

PreviousMaxSC — 0; 

for (i = 0; i < n; t = i + 1) 

SC, - 0; 

Enqueue: (Invoked when a packet arrives) 
i = QueueinWhichPackei Arrives: 
if (ExistslnAdiveList(i) == FALSE) then 
A ddToA diveListf i )\ 

I ncreinent SizeOfAcliveList ; 

SC, = 0; 
end if: 

Dequeue: 

while (TRUE) do 

if (RoundRobinVisitCount == 0) then 
PreviousMaxSC = MaxSC: 

RoundRobinVisitCount = SizeOfAcliveList ; 

MaxSC = 0; 

end if: 

i — HeadOfActiveList: 

RemoveHeadOfA dive List: 

.4* = 1 + PreviousMaxSC — SC,\ 

Senti = 0; 

do 

Transmit Packet EroniQuette(i); 

Increase Sent, by LengthlnFlitsOfTransmittedPacket ; 
while (Senti < A,); 

SC, = Senti — A;; 
if (SC, > MaxSC ) then 
MaxSC = SC,\ 
end if: 

if (QueuelsEmpty -- FALSE) then 
A ddQueueToA dive List (i): 

else 

SCi = 0 : 

Decrement SizeOJA dive List: 

end if: 

Decrement RoundRobinVisitCount: 

end while: 

Figure 2: Pseudo-code for ERR 


They maintain a linked list, called the ActiveList, of flows which are active. 
A flow whose queue was previously empty and therefore not in the ActiveList is 
added to the tail of the list whenever a new packet belonging to the flow arrives. 
The ERR scheduler serves the flow i at the head of this list. After serving flow 
i, if the queue of flow i becomes empty, it is removed from the list. On the other 
hand, if the queue of flow i is not empty after it has received its round-robin 
service opportunity, flow i is added back to the tail end of the list. 
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Consider the instant of time, t\ . when the scheduler is first initialized. They 
define Round 1 as one round-robin iteration starting at time t\ and consisting 
of visits to all the flows that were in the ActiveList at time t\. They illustrate 
this definition of a round using Fig. 3a. Assume that flows A, B, and C are 
the only flows active at the beginning of Round 1. The visits of the scheduler 
to flows A, B, and C, comprise Round 1. Let flow D become active after the 
time instant t\, but before the completion of Round 1. Let the time instant t-2 
mark the completion of Round 1. The scheduler does not visit flow D in Round 

1 since D was not in the ActiveList at the start of Round 1. Round 2 is now 
defined as consisting of the visits to all of the flows that are in the ActiveList 
at time £ 2 • Assuming that flows A, B, and C are still active at time t- 2 , Round 

2 will consist of visits to the flows A, B, C, and D. 


Round 1 


Round 2 


Flows Served: A, B, C 


Flows Served: A, B, C, D 


Flow D 

Becomes Active 


(a) 


Round 1 


Round 2 


Round 3 


Flow 0, A = 
Flow 1, A = 
Flow 2, A = 
Flow 0, A = 
Flow 1, A = 
Flow 2, A = 
Flow 0, A = 
Flow 1, A = 
Flow 2, A = 



SC = 31 


(b) 


Figure 3: (a) Definition of a Round and (b) an illustration of three rounds in 
an ERR execution 


In general, they define round i recursively as the set of visits to all the flows 
in the ActiveList at the instant round (i - 1) is completed. In order that the 
scheduler knows the number of flows it has to visit in any given round, they 
introduce the quantity RoundRobinVisitCount which denotes the number of 
flows that are in the ActiveList at the start of a round. RoundRobinVisitCount 
is decremented by one after each flow is served and, when it eventually equals 
zero, it implies the end of a round. 
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In each roundthe scheduling algorithm determines the number of flits that a 
flow is allowed to send. They call this quantity the allowance for the flow during 
that round. The allowance assigned to flow i during round r is denoted by Aj(r). 
This allowance, however, is not a rigid one and is actually elastic in that a flow 
may be allowed to send more flits in a round than its allowance. Let senti(r) be 
the number of flits that are transmitted from the queue of flow i in round r. The 
ERR scheduler will begin serving the next packet from the queue if the total 
number of flits transmitted by the flow so far in the current round is less than its 
allowance. The ERR scheduler, thus, makes the scheduling decision without any 
knowledge about the packet length. Note that the last packet transmitted by 
a flow may cause it to exceed its allowance, as can happen when the allowance 
is smaller than the size of the packet at the head of the corresponding queue. 
When a flow ends up sending more than its allowance, it is interpreted as having 
obtained more than its fair share of the bandwidth. The scheduler records this 
unfairness in the Surplus Count (SC) associated with each flow. The surplus 
count, during any round, is the number of flits the flow sent in addition to its 
allowance. Let SCi(r) denote the surplus count of flow i in round r. Then, after 
serving flow i in round r, the scheduler computes SCi(r) as follows: 

SCi(r) = Senti(r) - Ai(r) ( 1 ) 

Let MaxSC(r) denote the largest surplus count among all the flows served during 
round r. This quantity is used to recursively compute the allowances for each 
of the flows in the next round, using the following equation: 

A i (r) = l + MaxSC(r-l)-SC i (r-l) (2) 

Note that, for the flow with the largest surplus count in the previous round, 
the new allowance is 1. This is ensured by the addition of 1 in (2) so that the 
scheduler will transmit at least one packet from this flow during the next round. 
The allowance given to each of the flows in a given round is not fixed and is 
computed depending on the behavior of the flows in the previous round. After 
the ERR scheduler serves flow i, if the queue of flow i is empty, its surplus count 
is reset to zero and it is removed from the ActiveList. Otherwise, if flow i has 
packets in its queue that are ready for transmission, it is added back at the tail 
end of the list. 

Fig. 3b illustrates the first three rounds in an execution of the ERR schedul¬ 
ing discipline. In this figure, at the beginning of the first of these rounds, the 
surplus counts for all three flows and the MaxSC are all initialized to 0. Thus, 
from (2), the allowance during round 1 is equal to 1 for all the flows. The sizes 
of the packets actually sent by the flow during this round are shown by the 
horizontal bars and the new allowances for the next round are again computed 
using (1) and (2). It is easily observed from the figure that, in general, flows 
which receive very little service in a round are given an opportunity to receive 
proportionately more service in the next round. 
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3.3 Eligibility-Based Round Robin 

The unfairness of ERR is due to the fact that surplus counters are reset 
as soon as flows become idle. This implies forgetting about any excess service 
received by idling flows, which are allowed to compete for the channel bandwidth 
again as early as in the subsequent round. The Eligibility-Based Round Robin 
(EBRR) scheduling algorithm is introduced by [3]. This approach avoids the 
forgetting flaw of ERR. 


Flowlnit (flow i ) { 

C t — 

EC t = RC; 

) 

PacketArrival (packet a , flow t) { 
if (ldle( x )) { 

if ( RC > EC t ) { 

C t 

EC i = RC ; 

} 

EnqueueFloM, i , ActiveList{ EC, mod q )); 

} 

EnqueuePacket^ a , PacketQueue( i )); 

} 


EBRRScheduler{) { 
while (SystemBusy) { 

w hile (NotEmpty{ActiveList{ RC mod q)) { 
i ■ DequeueFlowiActiveLLsti RC mod q )); 
a = DequeueHeadPacket{ i ); 

Trans mi tPaeket{ a ); 

Ct = C t - Lenglhi a ); 
if (C, <0)( 

ECt = RC +1 + floor( —c t /$*); 

Ct-Ct +(1 + floorf-C,/^))- A; 

1 

if (Active^ i )) 

EnqueueFlow( i, AcriveLisi( EC, mod ■:)); 

} 

RC =NewRound(); 

) 

) 


Figure 4: Pseudo-code for EBRR 

A pseudo-code implementation of the EBRR scheduling algorithm is shown 
in Fig. 4, consisting of Flowlnit, PacketArrival and EBRRScheduler routines. 

First of all, whenever an eligible flow is selected for service, instead of trans¬ 
mitting multiple packets back-to-back from that flow until either it is no longer 
eligible or the flow queue is emptied out, the flow is allowed to transmit one 
packet only. After that, if the flow is still eligible, it is inserted back into the 
current list so that every other eligible flow (if any) is allowed to transmit a 
packet before the flow can transmit again. 

whenever a flow i becomes active after a packet arrival at an empty queue, 
the list into which the flow is inserted depends on how much surplus service, 
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if any, the flow received the last time that it was active and completed the 
transmission of packets in its queue. The Flowlnit function is invoked whenever 
a new flow is activated. It sets its credit to the quantum and its eligibility 
counter to the number of the current round, that is, the activating flow will be 
eligible as soon as it becomes backlogged. The Packet Arrival function is invoked 
whenever a new packet arrives at the scheduler. If the flow was previously idle, 
it is now backlogged and has to be inserted into one of the active lists. 

EBRRScheduler is invoked whenever there are packets to be scheduled. As 
long as the current list is not empty, the head-of-line flow i is selected for service. 
The head packet of flow i is dequeued and transmitted, and its length (after the 
packet transmission) is subtracted from flow is credit. When the current list is 
empty, a new round is started. 

3.4 Probabilistic Distance-based Arbitration 

Although round-robin arbitration provides local fairness across all routers, it 
does not provide any global fairness across all routers. As traffic flows through 
the network, it merges with newly injected packets and traffic from other di¬ 
rections in the network. This merging of traffic from different sources causes 
packets that have further to travel (more hops) to receive geometrically less 
bandwidth. As shown in Fig. 5, consider the 8-ary 1-mesh in Figure 5 where 
processors PO thru P6 are sending to P7. The switch allocates the output port 
by granting packets fairly among the input ports. With a round-robin arbitra¬ 
tion policy, the processor closest to the destination (P6 is only one hop away) 
will get the most bandwidth - 1/2 of the available bandwidth. The processor 

two hops away, P5, will get half of the bandwidth into router R6, for a total 
of 1/2 x 1/2 = 1/4 of the available bandwidth. That is, every two arbitration 
cycles P7 will deliver a packet from source P6, and every four arbitration cycles 
it will deliver a packet from source P5. As a result, PO and PI each receive only 
1/64 of the available bandwidth into P7, a factor of 32 times less than that of 
P6. Age-based arbitration [4] is known to provide global fairness as when two 



Figure 5: 8-ary 1-mesh example where all nodes are sending to P7 and merging 
traffic at each hop 

or more packets arbitrate for a shared resource, the packet with the oldest age 
wins the arbitration. However, it become complex to implement in an on-chip 
network constraint. EoS [5] is proposed to provide global fairness without any 


































significant complexity and relying only on local arbitration. EoS uses proba¬ 
bilistic distance based arbitration, thus avoiding the complexity with age-based 
arbitration by proposing to approximate the age of a packet with distance or 
hop count. By using information already present in the packet, such as source 
node, current node, or destination node and using distance as a proxy for the 
packets age, age-based arbitration is greatly simplified. 

By using priority based on hop count, livelock and fairness issues are prob¬ 
lematic because packets with a lower priority (i.e., a lower hop count) can con¬ 
tinually lose arbitration because of a constant stream of newly injected traffic 
with higher priority. To overcome this problem while still using hop count as 
the weight, EoS propose probabilistic arbitration where the output of the arbi¬ 
tration is probabilistically determined based on the weight of the input requests. 
Assume an arbiter with m requests, for each requests ri with a corresponding 
weight Wi. The probability of r, being granted is 


P(9i) 


Wi 


v ^j=m 
^ 3 =1 


Wj 


( 3 ) 


As shown earlier in Figure 5, nodes that are farther away are serviced at a 
rate that is exponentially proportional to the hop count. To account for this 
difference, EoS introduce nonlinear weights based on the distance. Instead of 
using a weight which is equal to the hop count (i.e., w = h), they introduce 
nonlinear weights in probabilistic arbitration , i.e. w = C h where C is the 
contention degree or the number of packets contending for the same output 
port. By using nonlinear weights, better fairness is provided for nodes that are 
farther away. 

Three arbitration metrics are designed in EoS, with C and h to be static or 
dynamic. 

1) Fix weight. The total number of hops a packet must travel from its source 
to its destination is a static value in a mesh network with minimal routing. 
The exponential h is set to be distance from source to destination, and C is 
set to be max contention degree in each router. 

2) Constantly Increasing Weight, the exponential h is set to be the distance 
a packet has traversed, and C is set to be max contention degree in each 
router. 


3) Variably Increasing Weight (VW). Instead of assuming a constant C value at 
each hop in each dimension, C is set dynamically to be the actual contention 
degree. 

On hotspot traffic where all nodes send traffic to a single destination, EoS match 
the performance of age-based arbitration in hotspot traffic and achieve equality 
of service. 
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4 Motivation 


4.1 The drawback of RFB 

In wormhole switch networks, the contention resources refer to the buffer, 
channel and crossbar. All the algorithms mentioned above measure the fairness 
based on the allocation of crossbar as the total length of packets forwarded in 
each flow is nearly identical. While contention and congestion always exist in the 
network, knowing the size of a packet does not imply we have the information 
that how long it will hold the channel, a.k.a. service time, since the latter might 
include blocking time. 

RFB merely consider the metric of crossbar fairness, which may result in 
the decrease in the throughput of the network. We will conceive an instance 
to illuminate this side-effect. Assume we have two flows A and B, and A is 
congested. As the granularity of scheduling is packet, when A is granted, it will 
cost a host of cycles to forward this whole packet and worsen the congestion 
at the same time. Then B is to some extent a victim as its latency soars due 
to As blocking time. By definition, in the next round the grant will allocate 
to A and lead to a vicious circle. However, this situation could be mitigated 
significantly if we take the congestion of flow A into account. When we perceive 
the service time is much longer than its packet size, we assert that this flow face 
the congestion. We will reduce the priority of this flow and give more chances 
for the other free-congest ion flows. Evidently the schedule violates RFB and is 
classified into unfair algorithm, while it is really reasonable and has the ability 
to achieve high throughput and low latency. 

4.2 Channel Fairness Bound 

The drawback of RFB is that it only takes crossbar resource as the metric 
and ignores the occupation time of the channel. We propose a novel evaluation 
Channel Fairness Bound (CFB) to evaluate the fairness of packet scheduling 
algorithm in a bid to evade the flaw of RFB. RFB merely comprises sending 
time, i.e., the packet size, while CFB consists of sending time and blocking time. 
There are two methods to make use of blocking time. We can still try to follow 
the evaluation standard of RFB to keep the sending time fair. However, we 
should set a threshold to detect the congestion and further modify the priority 
or the round-robin pointer. When the service time of a certain flow is over 
the threshold, it means congestion occurs and it is unadvisable to allocate the 
resources as usual. Free-congestion flows tend to receive the grants. But this 
modification is temporary. We presume this traffic jam will relieve significantly 
in the next several round and the flow can restore its previous status. It is 
worth noting that during the time the flow is degraded, the allocated crossbar 
resources go down accordingly. Whether a compensation is necessary depends 
the concrete algorithm. 

Another way is to measure fairness with sending time and blocking time 
instead of simple sending time. As briefly introduced above, DRR, ERR and 
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EBRR can easily fulfil the requirement of CFB. We just replace the packet size 
with channel occupation time and the fairness is the same as in the RFB. While 
as J. Kim et al. j5] point out, round-robin arbitration provides local fairness 
across all routers, it does not provide any global fairness across all routers. It is 
likewise a challenge if we adopt CFB. Here we will use the similar method with 
probabilistic distance based arbitration to achieve global fairness. 

4.3 Probabilistic Arbitration 

For convenience, we derive the theoretical feasibility of CFB to achieve global 
fairness on hotspot traffic where all nodes send traffic to a single destination. 
Probabilistic distance based arbitration guarantees equality of service by mea¬ 
suring the accepted throughput across all nodes. The metric of fairness is still 
RFB. Similarly, we will test whether this algorithm is capable to provide global 
fairness on the basis of CFB. 

A hotspot traffic example is shown in Fig. 6 where processors PO through 
P6 are sending data to P7. Generally speaking, the packet size is fixed in a 
specific network. So let L be the length of all packets. 



Figure 6: 8-ary 1-mesh example where all nodes are sending to P7 

We define some notations here. Let be the VC service time which router 
j offers to flow i , be the VC service time that router j offers to packet m 
of flow i , kij be the number of packets in flow i going through router j. By 
definition, we have T,-j = l We denote by Uj the average VC service 

time a packet in flow i is allocated. Then there is 

Tij = kijtij (4) 

From equation (4), we know that to ensure equality on VC service time for 
each flow, kij should be in inverse proportion to tij, i.e., the number of packets 
in flow i should be in inverse proportion to VC service time router j offers to a 
packet in flow i. As in probabilistic distance based arbitration, the weight of each 
flow determines kij, therefore the only problem to satisfy CFB is to attain the 
information of and further adjust k,j accordingly. This method can ensure 
that the channel occupation time in the destination node for each flow is nearly 
the same, which is one of the metric of CFB. While probabilistic distance based 
arbitration also achieves the global fairness under RFB in each node besides 
the destination, we will explore whether this scheduling can guarantee fairness 
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under CFB. Take W ( ij ) as the weight of flow i in router j , Rj(i) as the number 
of packets j accepts from flow i , and n as the number of flows router j accepts. 
By definition, there is 

R j =R j (0):R j (l):R j (2):.--:R j (n) (5) 

Trivially, we can obtain 

R 1 = W( 01) : W{ 11) (6) 


Consider R0 and R1 in Figure 6. Flow 0 consists of packets whose source 
is P0. When flow 0 arrives at Rl, its weight is W{ 01). When flow 1 arrives at 
Rl, its weight is VF(ll). According to the relation of weight and the number of 
packets described above, there is equation (6). By the same token, we can get 

R 2 = W{ 01) : W( 11) : (^jvF(Ol) + ^|w(ll)) (7) 

Consider R0, Rl and R2 in Figure 6. When flow 0 and 1 arrives at Rl, 
the weight proportion is (W(01), W(ll)) . Then flow 2 arrives at router 2. 
To maintain the weight proportion of flow 0, 1, 2 at router 2, when flow 0 sent 
W(01) packets and flow 1 sent W(ll) packets, number of packets flow 2 can send 
should be W(Ol) + j^^j w(ll). Then there is equation (7) In general, 
we have 


R x = W{ 01) : W{ll) 


j -1 


R > = : £ 


0 


W(ij) 


( 8 ) 


For simplicity, let W(jj) = 1. So equation is then simplified as follows 


j- 1 i 

R o = R j -1 : X! w(ij) Rj ~ ^ ^ 

We observe that packets that go through router j-1 will also go through 
router j. Assume that the service time for all flows in each router is under 
equality of service. We denote by Sij the ratio of channel occupation time to 
sending time. With Ri = W(01) : W{ 11), we have 

1^(01)5(01) = W (11)5(11) (10) 

In a similar way, we can get 

tu(01) s(12) 

w(ll) ~ s{ 02) 

Therefore, we can obtain the eciuality without W 

s(ll) s(12) 
s(01) " s(02) 


( 11 ) 

( 12 ) 
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In general, we have 


( 13 ) 


S(mk) S(mt) 

S(nk) S(nt) 

In wormhole switching networks, restriction (13) cannot be satisfied. Thereby 
probabilistic distance based arbitration cannot achieve fairness under CFB. 
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